Speaker Localisation Using the Far-field Srp-phat in Conference Telephony
نویسندگان
چکیده
This paper describes a robust algorithm for sound source localization in conference rooms. The method used is a modified steered response power phase alignment transform algorithm. The results are obtained by processing real data recorded in a typical conference room, and they are compared to data obtained from a simple free-field model. The algorithm demonstrates good accuracy for finding the correct angle of arrival for the dominant speaker in the room and works well for speech sources. The algorithm integrates well with subband decomposition and is suited for real-time applications.
منابع مشابه
Speaker Localization , tracking and remote speech pickup in a conference room
Effective speech communication using microphone Array is getting significant research in speech acquisition methods such as speaker localization and tracking. Localization techniques play an important role for automatic camera in videoconference system and for other human machine interfaces. To locate the accurate Direction Of Arrival (DOA) from the source, it is necessary to design a suitable ...
متن کاملCalibration errors of uniform linear sensor arrays for DOA estimation: an analysis with SRP-PHAT
This article presents an analysis of the sensitivity of geometrical sensor errors in acoustic source localization using the well-established SRP-PHAT method. The array in this analysis is a uniform linear array and the intended source is human speech in the far field. Two major results are presented: inner-sensor geometrical errors in the linear array produce smaller localization errors than co...
متن کاملVoice activity detection and speaker localization using audiovisual cues
This paper proposes a multimodal approach to distinguish silence from speech situations, and to identify the location of the active speaker in the latter case. In our approach, a video camera is used to track the faces of the participants, and a microphone array is used to estimate the Sound Source Location (SSL) using the Steered Response Power with the phase transform (SRP-PHAT) method. The a...
متن کاملExperimental evaluation of multi-band position-pitch estimation (m-popi) algorithm for multi-speaker localization
This paper proposes an enhancement and evaluates the performance of the joint position and pitch estimation (PoPi) algorithm for speaker localization. A modification in the algorithm is introduced in order to improve the performance under high reverberation levels. The performance of the proposed method is evaluated by measuring the correct estimate of position at a frame level. This evaluation...
متن کاملFast and Robust Realtime Speaker Tracking Using Multichannel Audio and a Particle Filter
In this work a method to track the azimuth (horizontal angle) from multiple speakers in a typically reverberant real office environment is presented. The steered-response-power algorithm (SRP-PHAT) or the recently published joint position and pitch extraction approach (PoPi) combined with a sequential Monte Carlo estimation leads to a robust and fast tracker for audio indexing. One intention of...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2006